Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
نویسندگان
چکیده
Morphological data provide the only means of classifying the majority of life's history, but the choice between competing phylogenetic methods for the analysis of morphology is unclear. Traditionally, parsimony methods have been favoured but recent studies have shown that these approaches are less accurate than the Bayesian implementation of the Mk model. Here we expand on these findings in several ways: we assess the impact of tree shape and maximum-likelihood estimation using the Mk model, as well as analysing data composed of both binary and multistate characters. We find that all methods struggle to correctly resolve deep clades within asymmetric trees, and when analysing small character matrices. The Bayesian Mk model is the most accurate method for estimating topology, but with lower resolution than other methods. Equal weights parsimony is more accurate than implied weights parsimony, and maximum-likelihood estimation using the Mk model is the least accurate method. We conclude that the Bayesian implementation of the Mk model should be the default method for phylogenetic estimation from phenotype datasets, and we explore the implications of our simulations in reanalysing several empirical morphological character matrices. A consequence of our finding is that high levels of resolution or the ability to classify species or groups with much confidence should not be expected when using small datasets. It is now necessary to depart from the traditional parsimony paradigms of constructing character matrices, towards datasets constructed explicitly for Bayesian methods.
منابع مشابه
Phylogenetic and coalescent strategies of species delimitation in snubnose darters (Percidae: Etheostoma).
The rapid accumulation of multilocus data sets has led to dramatic advances in methodologies for estimating evolutionary relationships among closely related species, but relatively less advancement has been made in methods for discriminating between competing species delimitation hypotheses. Multilocus data sets provide an advantage in testing species delimitation scenarios because they offer a...
متن کاملA comparative phylogenetic analysis of Theileria spp. by using two two "18S ribosomal RNA" and "Theileria annulata merozoite surface antigen" gene sequences
More than 185 species, strains and unclassified Theileria parasites are categorized in the Entrez Taxonomy. The accurate diagnosis and proper identification of the causative agents are important for understanding the epidemiology, prevention and appropriate treatment. This study aims to discuss the importance of two genes of Theileria annulata 18S ribosomal RNA (18S rRNA) and Theileria annulata...
متن کاملDirect Molecular Detection and Phylogenetic Tree Analysis of Gastrointestinal Protozoan Parasites (Giardia lamblia, Entamoeba histolytica, Cryptosporidium parvum) from Diarrhea Infection in Kut City of Iraq: A Short Communication
Background: The intestinal tract of human can be infected by protozoan parasites. In this short communication, the stool samples were collected from patients with diarrhea referred to Kut hospital, Iraq, and then the parasites (Giardia lamblia, Entamoeba histolytica, Cryptosporidium parvum) were considered for molecular identification. Methods: Stool samples were collected from 69 patients wit...
متن کاملUpdate on HCV genotypes among Iranian blood donors
Abstract Background and Objectives Hepatitis C (HCV) infection is one of the main causes of chronic hepatitis diseases all over the world. HCV is a transfusion transmitted virus and a serious threat to general health. HCV genotyping has an important role in tracing routes of infection. This study aimed at investigating the changes in distribution pattern of HCV genotypes among Iranian blood d...
متن کاملTitle : A weighted least - squares approach for inferring phylogenies from incomplete distance matrices
Motivation: The problem of phylogenetic inference from data sets including incomplete or uncertain entries is among the most relevant issues in systematic biology. In this paper, we propose a new method for reconstructing phylogenetic trees from partial distance matrices. The new method combines the usage of the four-point condition and the ultrametric inequality with a weighted least-squares a...
متن کامل